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Amendment in Reply to Final Office Action of July 25, 2005 

IN THE CLAIMS 
Please amend claim 1 and 18 as follows : 

■ 

1 1. (Currently Amended) A speech recognition system comprising 

2 an acoustic detector for detecting speech utterances of a speaker; 

3 a visual detector for detecting at least one facial characteristic 

4 associated with speech utterances of the speaker; a processing 

5 arrangement connected to he responsive to the acoustic and visual 

* 

6 detectors for deriving a signal having first and second values 

7 respectively indicative of the speaker making and not making speech 

8 utterances such that the first value is derived in response to the 

9 acoustic detector detecting a finite, nonzero acoustic response 

10 while the visual detector detects at— l€^a ct on© faci al 

11 Gharacterist - i - c a ss ociated with sp ee ch utteranc es of that the 

12 speaker is facing a predetermined direction ; and a speech 

13 recognizer for deriving an output indicative of the speech 

14 utterances as detected only by the acoustic detector, the speech 

15 recognizer being connected to be responsive to the acoustic 

16 detector in response to the signal having the first value. 

1 2. (Original) The speech recognition system of claim 1 wherein 

2 the processing arrangement causes the signal to have the second 
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3 value in response to any of: (a) the acoustic detector not 

4 detecting a finite, nonzero acoustic response while the visual 

5 detector does not detect speech utterances of the speaker, (b) the 

6 acoustic detector detecting a finite, nonzero acoustic response 

7 while the visual detector does not detect speech utterances of the 

8 speaker, and <c) the acoustic detector not detecting a finite, . 

9 nonzero acoustic response while the visual detector detects speech 

» 

10 utterances of the speaker. 


1 3. (original) The speech recognition system of claim 2 wherein 

2 the processing arrangement includes a delay arrangement for 

3 assuring that the beginning of each speech utterance is coupled to 

4 the speech recognizer. 


1 4 . (Original) The speech recognition system of claim 3 wherein 

2 the delay arrangement includes a memory element connected to be 

3 responsive to the acoustic detector, the memory element including a 

4 plurality of stages for storing sequential segments of the output 

m 

5 of the acoustic detector, the delay arrangement being such that the 

6 contents of the memory element stage storing the beginning of a 

7 speech utterance are initially coupled to the speech recognizer. 
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1 5. (Original) The speech recognition system of claim 4 wherein 

2 the memory element includes a ring buffer. 


1 6. (Original) The speech recognition system of claim 1 wherein 

2 the processing arrangement includes a delay arrangement for 

3 assuring that the beginning of each speech utterance is coupled to 

4 the speech recognizer. 


1 7. (Original) The speech recognition system of claim 6 wherein 

2 the delay arrangement includes a memory element connected to be 

3 responsive to the acoustic detector, the memory element including a 

4 plurality of stages for storing sequential segments of the output 

5 of the acoustic detector, the delay arrangement being such that the 

6 contents of the memory element stage storing the beginning of a 

7 speech utterance are initially coupled to the speech recognizer. 


1 8. (Original) The speech recognition system of claim 7 wherein 

2 the delay arrangement is arranged for assuring that upon the 

3 completion of each speech utterance the acoustic detector is 

+ 

4 decoupled from the speech recognizer. 
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■ 

1 9 . (Previously Presented) The speech recognition system of 

2 claim 1 wherein the processing arrangement includes a delay 

3 arrangement for assuring that in response to completion of each 

4 speech utterance the acoustic detector is decoupled from the speech 

5 recognizer. 

1 10. (Original) The speech recognition system of claim 8 

2 wherein the delay arrangement includes a memory element connected 

3 to be responsive to the acoustic detector, the memory element 

4 including a plurality of stages for storing sequential segments of 

5 the output of the acoustic detector, the delay arrangement being 

6 such that the contents of the memory element stage storing acoustic 

7 energy associated with the acoustic detector and which occurs upon 

8 completion of each speech utterance is prevented from being coupled 

9 to the speech recognizer, 

1 11. (Original) The speech recognition system of claim 9 

2 wherein the memory element includes a ring buffer. 

1 12 . (Original) The speech recognition system of claim 1 

2 wherein the processing arrangement includes a face recognizer 

3 connected to be responsive to the visual detector. 
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1 13, (Previously Presented) The speech recognition system of 

2 claim 12 wherein the face recognizer is arranged for enabling the 

3 signal to have the first value in response to the face of the 

4 speaker being at a predetermined orientation relative to the visual 

5 detector. 

1 14 . {Previously Presented) The speech recognition system of 

2 claim 13 wherein the face recognizer is arranged for: (1) detecting 

3 and distinguishing the faces of a plurality of speakers, and (2} 

4 enabling the signal to have the first value in response to the 

5 speaker having a recognized face. 

1 15 . (Previously Presented) The speech recognition system of 

2 claim 14 wherein the processing arrangement includes a speaker 

3 identity recognizer connected to be responsive to the acoustic 

4 detector, the speaker identity recognizer being arranged for: (1) 

5 detecting and distinguishing speech patterns of a plurality of 

6 speakers, and (2> enabling the signal to have the first value in 

7 response to the speaker having a recognized speech pattern. 
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1 16 . {Previously Presented) The speech recognition system of 

2 claim 15 wherein the processing arrangement is arranged for causing 

3 the signal to have the first value in response to the speaker 

4 having a recognized face matched with a recognized speech pattern 

5 of the same speaker. 

■ 

1 17 . (Previously Presented) The speech recognition system of 

2 claim 1 wherein the processing arrangement includes a face 

3 recognizer connected to be responsive to the visual detector and a 

4 speaker identity recognizer connected to be responsive to the 

5 acoustic detector, the face recognizer being arranged for detecting 

6 and distinguishing the faces of a plurality of speakers, the 

7 speaker identity recognizer being arranged for detecting and 

8 distinguishing speech patterns of a plurality of speakers, the 

9 processing arrangement being arranged for enabling the signal to 

10 have the first value only in response to the speaker having a 

11 recognized face matched with a recognized speech pattern of the 

12 same speaker, 

1 18. (Currently Amended.) A method of recognizing speech 

2 utterances of a speaker with an automatic speech recognizer only 

3 responsive to acoustic speech utterances of the speaker comprising 
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4 detecting acoustic energy having a spectrum associated with speech 

5 utterances, detecting at least one facial characteristic associated 

6 with speech utterances of the speaker, and activating the automatic 

7 speech recognizer in response to the detected acoustic energy 


8 


10 


having a spectrum associated with speech utterances while the a£ 
l e a s t one facial characteristic a s sociated with s p ee ch utt e rance s 
o^— the speaker is occurring facing a predetermined direction . 


1 19 . {Original) The method of claim 18 further comprising 

2 preventing activation of the automatic speech recognizer in 

3 response to any of: (a) no acoustic energy having a spectrum 

4 associated with speech utterances being detected while no facial 

5 characteristic associated with speech utterances of the speaker is 

6 detected, (b) acoustic energy having a spectrum associated with 

7 speech utterances being detected while no facial characteristic 

8 associated with speech utterances of the speaker is detected, and 

9 (c) no acoustic energy havingr a spectrum associated with speech 

10 utterances being detected while at least one facial characteristic 

11 associated with speech utterances of the speaker is detected. 
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1 20 . {Original) The method of claim 18 further comprising 

2 assuring that the beginning of each speech utterance is coupled to 

m 

3 the speech recognizer. 

t * 

1 21. (Original) The method of claim 20 wherein the beginning of 

2 each speech utterance is assuredly coupled to the speech recognizer 

3 by: (a) delaying the speech utterance, (b) recognizing the 

4 beginning of each speech utterance, and (c) responding to the 

5 recognized beginning of each speech utterance to couple the delayed 

6 speech utterance associated with the beginning of each speech 

7 utterance to the speech recognizer and thereafter sequentially 

8 coupling the remaining delayed speech utterances to the speech 

9 recognizer. 

1 22. (Original.) The method of claim 18 further comprising 

2 assuring that no detected acoustic energy is coupled to the speech 

3 recognizer upon the completion of a speech utterance. 

1 23 . (Original) The method of claim 22 wherein assurance that 

2 no detected acoustic energy is coupled to the speech recognizer 

3 upon the completion of a speech utterance is provided by: (a) 

4 delaying the acoustic energy associated with the speech utterance, 
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5 (b) recognizing the completion of each speech utterance, and (c) 

6 responding to the recognized completion of each speech utterance to 

7 decouple delayed acoustic energy occurring after the completion of 

8 each speech utterance from the speech recognizer. 

1 24. (Original) The method of claim 18 wherein the at least one 

2 facial characteristic indicates the face of the speaker has a 

3 predetermined orientation relative to a detector involved in the 

4 step of detecting the at least one facial characteristic. 

1 25 . (Previously Presented) The method of claim 24 further 

2 comprising distinguishing the face of the speaker from a plurality 

3 of speakers, distinguishing the speech pattern of the speaker from 

4 a plurality of speakers, and activating the automatic speech 

5 recognizer in response to the speaker having a recognized face 

6 matched with a recognized speech pattern of the same speaker. 

1 26 . (Previously Presented) The method of claim 18 further 

2 comprising distinguishing the face of the speaker from a plurality 

3 of speakers, distinguishing the speech pattern of the speaker from 

4 a plurality of speakers, and activating the automatic speech 
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5 recognizer in response to the speaker having a recognized face 

6 matched with a recognized speech pattern of the same speaker. 


1 27 . (Previously Presented) The method of claim 26 further 

2 including storing: (1) images of the faces of a plurality of 

3 speakers, and (2) the speech patterns of the same plurality of 

4 speakers during at least one training period; and performing the 

5 distinguishing step by comparing the stored images and speech 

6 patterns with images of the face of the speaker and the speech 

7 pattern of the speaker. 
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